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Abstract. Most networks found in social and biochemical systems have modular structures. An important 
question prompted by the modularity of these networks is whether nodes can be said to belong to a single 
group. If they cannot, we would need to consider the role of "overlapping communities." Despite some 
efforts in this direction, the problem of detecting overlapping groups remains unsolved because there is 
neither a formal definition of overlapping community, nor an ensemble of networks with which to test 
the performance of group detection algorithms when nodes can belong to more than one group. Here, 
we introduce an ensemble of networks with overlapping groups. We then apply three group identification 
methods — modularity maximization, fc-clique percolation, and modularity-landscape surveying — to these 
networks. We find that the modularity-landscape surveying method is the only one able to detect het- 
erogeneities in node memberships, and that those heterogeneities are only detectable when the overlap is 
small. Surprisingly, we find that the fc-clique percolation method is unable to detect node membership for 
the overlapping case. 
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1 Introduction 



Real- world networks including man-made and natural net- 
works are strongly modular, that is, the pattern of connec- 
tions among nodes is not homogeneous [HI]. The modu- 
larity of a network is a consequence of the fact that there 
are groups of nodes in the network that preferentially con- 
nect to one another [TK^KSUHISliniEllH] . However, the assign- 
ment of nodes into those groups still remains a challenging 
task because, typically, nodes also connect to nodes that 
are not in their group [SUnilH]- Additionally, nodes may 
hold membership in more than one group pi8l[9l fT0l[TT] . 
resulting in groups that "overlap." 

The question of whether there are nodes that belong 
to more than one group and how important overlapping 
groups are to the network's organization is especially rele- 
vant in social and biochemical systems, in which typically 
nodes are thought to belong to more than one group. 
Consider, for instance, the network of scientific collabo- 
rations within an institution: People with joint appoint- 
ments would be expected to appear in more than one 
group. Or, consider the network of physical interactions 
between proteins: Topological modules are thought to have 
a strong correlation with biological function [12]. Since 
many proteins are known to have more than one function, 



one would expect these proteins to belong to more than 
one group. 

Recently, methods to uncover the hierarchical organi- 
zation of networks |13[|14j have been proposed, opening 
the possibility of performing multi-scale analysis on so- 
cial, biological, and economical systems for which large 
amounts of data are available. However, a potential caveat 
of these methods is that they do not take into account the 
fact that nodes could hold membership in more than one 
group. Modularity maximization methods [7|fT5l[T6]. which 
have been successful at finding correlations between net- 
work function and structure, suffer from the same prob- 
lem. The impact of neglecting overlapping groups has not 
been assessed so far, since there is neither a formal defi- 
nition of overlapping group nor a set of models on which 
to test overlap identification algorithms. 

Here, we introduce an ensemble of networks [17j that 
have overlapping groups by construction [IT] . We then ap- 
ply three different group detection methods — modularity 
maximization p^lfTS] , A;-clique percolation [8] , and modularity- 
landscape surveying [T3] — to these networks. We find that 
the modularity-landscape surveying method is the only 
one able to detect heterogeneities in node memberships, 
and that these heterogeneities are detectable provided the 
overlap is significantly smaller than the size of the modules 
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2 Model networks 

Consider a network comprised of N nodes and L edges. 
Let TV = { rii : i — l,...,iV} be the set of nodes and 
G — { gx X — a,b, . . . } he the set of groups in which 
the nodes can hold membership. Specifically, let Qi C G 
be the set of groups in which node rii holds membership. 
Without loss of generality, we assume that all groups with 
identical membership lists have been merged and that all 
groups have at least two members. 

Here, we focus on the ensemble of random networks 
in which the probability pij of the edge (rii^nj) being 
present in a network is a function solely of the set of co- 
memberships of the two nodes A4ij — Gi^ Gj- We restrict 
our attention to the case where pij is a non-decreasing 
function of the cardinality of A4ij. For undirected net- 
works, this is the most plausible case. Indeed, most clus- 
tering algorithms used to investigate the modular struc- 
ture of networks have at their basis the assumption that 
this is the appropriate case. Such an assumption is sup- 
ported by the fact that those methods return plausible 
results for those networks [7l[T7 l fT8 l fT9 j l20j . 

We consider the case where pij = pi, where I = \ 
is the cardinality of , and, further, we assume that 
Po < Pi < P2 < ■ • •■ An implication of our choice for pij is 
that if one selects a sub-set of Af in which all pairs of nodes 
have non-empty co-membership sets, then there will be 
more edges connecting these nodes than one would expect 
to find by chance. Or, in other words, we expect to find 
more edges connecting these nodes than if all links had 
the same probability p = 2L/N{N — 1) of being present. 
In contrast, if one selects a sub-set of Af in which all pairs 
of nodes have empty co-membership sets, then there will 
be fewer edges connecting these nodes than one would 
expect to find by chance. These facts directly suggest that 
the maximization of a modularity function such as that 
proposed by Newman and Girvan 1 will enable one to 
identify node membership in modular networks (see [13|, 
[2TII22] for caveats to this argument). 

The ensemble of networks we focus on comprises two 
distinct sub-ensembles. The first sub-ensemble, which we 
denote transitive, conforms to a transitive relationship 
among co-membership sets. That is, if the co-membership 
set Mij is non-empty and the co-membership set Mik is 
also non-empty, then the co-membership set Aijk must 
also be non-empty. The second sub-ensemble, which we 
denote non-transitive, does not conform to a transitive 
relationship among co-membership sets. 

Networks in the transitive sub-ensemble have the prop- 
erty that every node must hold membership in only one 
group (if identical groups have been collapsed). In con- 
trast, networks in the non-transitive sub-ensemble have 
some nodes that hold membership in more than one group. 
Most module detection algorithms in the literature deal 
only with the transitive sub-ensemble tLil5,20,23j, that 
is, nodes are divided into "exclusive groups." A notable 
exception is the work of Palla et al. [5], which highlights 
the possibility that a network will contain nodes belonging 



communities." Regretfully, Palla et al. [8 do not define 
ensembles of networks with overlapping groups. 

Another significant exception is the work of Sales-Pardo 
et al. [IBJ, which determines community structure even 
when hierarchical levels of structure exist. These hierar- 
chical levels of structure indicate that nodes may belong 
to more than one group, but only when multiple layers 
are considered. Here, we define an ensemble of networks 
in which most nodes hold membership in a single group, 
while a small fraction of nodes hold membership in two or 
three groups (Fig. [1]). 

3 Community detection 
3.1 Description of the methods 

Let us now address the question of detectability of the 
memberships of individual nodes. Ideally, one wishes to 
detect all group memberships from the topology of the 
network alone. For the case of transitive networks, it has 
already been shown that when pi , the probability that two 
nodes belonging to the same group are connected, is not 
much larger than po, it is impossible to extract the correct 
membership assignment from the network structure alone 
^15]. Here, we focus on the detection of node member- 
ship for the ensemble of non-transitive networks described 
above. We consider three different classes of group detec- 
tion algorithms: modularity maximization [TlllSj. fc-clique 
percolation and modularity-landscape surveying |13| . 

Modularity maximization methods are the current "gold 
standard" for group identification ^23] . In this approach, 
nodes are classified into groups that maximize the num- 
ber of within group edges compared to those that would 
be expected from chance alone fUEUUISllTj . Some of the 
proposed algorithms, such as spectral decomposition, are 
extremely fast and can handle networks comprised of hun- 
dreds of thousands of nodes [7]. However, this approach is 
clearly geared toward networks with transitive member- 
ship structures since every node must be classified into a 
single group. 

The fc-clique percolation method introduced by Palla 
et al. Sj is based on the observation that networks some- 
times contain connected cliques of the same size [5]. In 
this method, a group comprises chains of "adjacent" k- 
cliques — where two fc-cliques are adjacent if they share 
k — 1 nodes. A strength of this approach is that nodes 
can be classified into more than one group, making it a 
priori well-suited to investigate non-transitive networks. 
Two limitations of this this approach, however, are that 
different values of k will result in different group member- 
ship patterns, and that sparse networks might contain a 
very small number of cliques with k > 2. 

The modularity-landscape surveying method 13J, or 
MLS, is based upon the observation that the modular- 
ity landscape is very rugged and has many local max- 
ima, which means that there are many partitions of nodes 
into groups characterized by high values of the modularity 
function. In analogy to disordered physical systems whose 
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the set of all local maxima conveys the relevant contribu- 
tion to the system's physical properties. Thus, the method 
samples all the partitions V corresponding to local max- 
ima with probabilities proportional to the size of their 
basins of attraction. Then, it builds a co-classification ma- 
trix A, in which each element Ay corresponds to the ex- 
pected fraction of the time in which a pair of nodes (n^ , n,j ) 
is classified in the same group. As discussed above, this 
method does not restrict nodes to hold membership in a 
single group, but rather, by collecting statistics, it reports 
the likelihood that two nodes are members of the same 
group (or sets of groups), and therefore it is a priori suit- 
able to identify node memberships in networks with both 
transitive and non-transitive memberships. 



where P is the list of groups in the first partition, Q is 
the list of groups in the second partition, N is the total 
number of nodes, Ni is the number of nodes in group gi in 
the first partition, Nj is the number of nodes in gj in the 
second partition, and Nij is the number of nodes that are 
both in gi and gj. Note that this expression is symmetric; 
thus, it is an unbiased metric to compare the similarity of 
two partitions. 

If the partitions are identical, M/ = 1, whereas if the 
two partitions are totally uncorrelated. Mi = 0. Note, 
however, that for the case in which each node is placed 
into a separate group, one has Mi — Mj > 0. We thus 

report to/ = ^^'^i^^' , so that values of m/ greater than 

zero indicate significant accuracy. 



3.2 Random network ensembles 

In order to investigate the detectability of the membership 
structure of a network, we generate random networks in 
which nodes can belong to five groups, G = {ga, gt, gc, gd, <7e} 
We consider the cases in which most of the nodes belong to 
a single group, {ga}, {gb}, {gc}, or {gd}, and the remaining 
nodes belong to two groups, {ga,gb}, or to three groups, 
{ffa, gb, 5e }■ We then assume that if two nodes have mem- 
bership in the same group, they will be linked with proba- 
bility pi > po- Similarly, if two nodes have membership in 
the same two groups, they will be connected with proba- 
bility P2 > Pi - Note that as r = Po/pi approaches one, the 
harder it becomes to detect the co-membership structure 
of a network. Also, since some nodes belong to multiple 
groups, the degree of these overlapping nodes will be larger 
than that for a node belonging to only one group. Hence- 
forth, we denote the average degree of a node belonging 
to only one group by z. 

If a node holds membership in only one group, say ga, 
then it belongs to group A. If some nodes hold membership 
in two groups {ga, gb}, then they belong to groups A and 
B. If the latter case is true, then groups A and B overlap. 
We define the "overlap size" s as 



that is, the number of nodes in both A and B divided by 
the combined size of groups A and B. Thus, an important 
issue regarding co-membership detection when nodes hold 
membership in more than one group is how the size of the 
overlap affects the accuracy in detecting group member- 
ship. 

3.3 Mutual information 



4 Results 

To compare the performance of the methods for the en- 
semble of model networks with overlapping groups previ- 
ously introduced, we generate ten networks for each set 
of parameter values and apply the three group detection 
algorithms to each network. Figure [2] displays the typical 
results obtained for a network with parameters r = 0.125, 
z = 16, p2 = Pi or p2 = 2pi, and s = 0.125 or s = 0.25. 

To determine the accuracy of each method, we com- 
pare the partitions returned by each method to the known 
division of nodes into groups. Specifically, we use the nor- 
malized mutual information to/, which quantifies the amount 
of information that two different partitions share [23j . Fig- 
ure [3] displays the average to/ versus z for p2 = pi and 
s — 0.125 or s = 0.25 and different values of r. 

Since the average degree of a node should strongly af- 
fect the ability of each group detection method to detect 
the known group structure, we systematically investigate 
degree effects. We expect that, as degree increases, the 
difficulty of detection should decrease. Also, as the num- 
ber of nodes having membership to two or more groups 
increases, the difficulty of detection should increase. 

4.1 Modularity maximization 

The results obtained with the spectral decomposition method 
exhibit different behaviors depending on the size of the 
overlap: for small overlaps (s = 0.125), the method iden- 
tifies four groups, whereas for large overlaps (s = 0.25), 
it identifies three groups, such that the two overlapping 
groups are combined into a single one, as predicted by For- 
tunate and Barthclcmy 21 . Note that there are no signif- 
icant differences between the cases p2 = pi and p2 — 2pi . 



To quantify the similarity between two partitions of nodes, 
we calculate the mutual information between the two par- 
titions [231: 



M, 



NiNj 



(2) 



4.2 Modularity-landscape surveying 

In contrast, the modularity-landscape surveying method 
is able to uncover more information about the underlying 
organization of the nodes in the network than either the 
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Even for small overlaps, the algorithm is able not only to 
identify densely interconnected groups of nodes, but is also 
able to detect that the overlapping groups have more in 
common with each other than with the remaining groups 
(Fig.©. 



collection of edges alone. The signal is only distinct for 
s < 0.1, and even then it is not clear whether one can 
distinguish between the case in which two groups overlap 
and the case in which a group comprises two sub-groups 
(Fig. [3 [13]. 



4.3 A;-clique percolation 

The results obtained with the fc-clique method depend 
strongly on the value of k. For fc = 3, the method is unable 
to detect the modular structure of the networks; it places 
all the nodes into a single group. For fc = 4, the two over- 
lapping groups are mostly combined into one group for 
both large and small overlaps. Finally, for fc = 5, the al- 
gorithm does not identify any sizable group of nodes in 
the network. In fact, the signal provided by the fc-clique 
method is weaker than that provided by the adjacency 
matrix. Surprisingly, even though the fc-clique method al- 
lows nodes to belong to more than one group, the nodes 
placed in multiple groups do not in general correspond to 
the nodes belonging to the overlapping groups (Fig. [2]). 

We find that the accuracy of the fc-clique method is 
always much smaller than that of the spectral decomposi- 
tion and modularity-landscape surveying methods. In fact, 
in order for the fc-clique method to return results that are 
significant, one must have r < 0.25. Moreover, for fc = 4, 
the accuracy of the method decreases as the density of 
edges increases. 

For low edge densities, the network does not contain 
any 5-cliques, so the groups identified for fc = 5 are un- 
reliable. These results point to a severe limitation of the 
fc-clique method: similar networks require different fc val- 
ues in order to yield meaningful results, and even when 
group detection is meaningful the method always performs 
significantly worse than modularity based methods. 

4.4 Overlap detectability 

These results suggest that the detection of overlapping 
groups may be essentially impossible when the overlap is 
large. However, for small overlaps, the modularity-landscape 
surveying method is able to detect heterogeneities in node 
group membership. The question that arises is thus how 
small should the overlap be in order to be detected and 
whether detection may ever be unambiguous. To answer 
this question, we analyze model networks with groups 
comprising 100 nodes, z — IQ, r — 0.1, and p2 = 2pi, for a 
wide range of overlap sizes s — 0.02, . . . , 0.25 (Figs. [4] and 
[6|) . The spectral decomposition method shows a transition 
from identifying four groups (s < 0.175) to identifying 
three groups (s > 0.2). For s < 0.175, the modularity- 
landscape surveying method is able to detect the signa- 
ture of heterogeneities in node membership. However, the 
"signal" fades as s increases. 

Like the spectral decomposition method, the modularity- 
landscape surveying method also indicates that there are 
three different groups for s > 0.2. Additionally, it is im- 



5 Conclusions 

The ability to detect overlapping communities within real- 
world networks would greatly enhance understanding of 
phenomena such as synchronization |27j . However, our 
analysis reveals that the group detection methods in the 
literature are not entirely equipped to handle such infor- 
mation. In some cases, these methods may require tunable 
parameters, such as in the k-clique percolation method [8] 
and the method of Gfeller et al. [10] . The promising method 
of Nepusz et al. [11 aims to obtain the global organi- 
zation of a network while determining which nodes act 
as "bridges" between communities. This method captures 
some of the same information as the modularity-landscape 
surveying method, but requires additional centrality cal- 
culations to correct for the "bridgeness" score. Further- 
more, even the recently proposed modularity-landscape 
surveying method, which can detect small overlaps, is not 
able to unambiguously differentiate overlapping groups 
from hierarchically-organized groups. 
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Appendix A 

To characterize how the degree and the size of the overlap 
affect group detection, we generate forty networks com- 
posed of four groups of 100 nodes each for every set of con- 
ditions tested. We tested the degree of a non-overlapping 
node at values of z = 10,20,30,40,50, and we tested the 
size of the overlap, s, for s = 0.16 through s = 0.25. Each 
of these networks was generated with p2 = 2pi , except for 
the case z = 20, for which we also studied networks with 

Since the spectral decomposition method is very fast 
and is considered the gold standard for group detection, 
we applied this method on each of the networks generated 
(Fig. [6^). We expected the spectral decomposition method 
to detect four groups for low overlap sizes for every degree 
tested, and that it would detect three groups at higher 
overlap sizes or at higher node degree. We expected also 
that the higher the degree, the faster the transition be- 
tween four detectable groups and three detectable groups. 
Since some of the groups reported by the method were 
very small, we calculated the effective number of groups, 

A^cffcctivc. 
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where G is the Hst of groups in the partition of the network 
returned by the method, Si is the number of nodes in a 
group within Q, and N is the total number of nodes in 
the network. For the networks with an average degree of 
z = 20 and p2 — Pi, we find a transition from detecting 
four groups to detecting three groups for s « 0.18, while 
for z = 50, the transition occurs by s = 0.16. 

However, we wanted to further investigate the detectabil- 
ity for the degrees and overlap sizes chosen. Specifically, 
we examine the detection resolution limits as outlined by 
Fortunato and Barthelemy in |,21j . In their paper, Fortu- 
nato and Barthelemy indicate that in order for a group to 
be unambiguously detectable by spectral decomposition, 
it must meet two criteria: (i) the number of links within 
group g, or Ig, should be less than the total number of 
links L divided by four, or < j; and (ii) the ratio of 
links leaving the module, i™*, to links within the module 

should be less than two, or a — —, — < 2. All of the de- 
tected groups for each of the networks satisfied the second 
condition. Figure [5] shows 4 * ^ versus s for each degree. 
In this figure, a value less than one indicates that the res- 
olution detection limit is satisfied. Comparing Figures [S^ 
and b, we see that as soon as the resolution detection 
limit is violated, the number of detected groups decreases, 
so our results are consistent with those of Fortunato and 
Barthelemy [21]. 
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Fig. 1. (Color online) Random network models with overlapping groups. Consider a network with eight nodes, ni through ng. 
We assign these nodes to three different groups which we indicate by three different colors, blue, yellow, and purple. Moreover, 
we allow the blue and yellow groups to overlap; nodes ns and 714 belong to both yellow and blue groups. We highlight these 
overlapping nodes in green, (a), Purple nodes connect to other purple nodes with probability pi and to blue, green, and yellow 
nodes with probability po < pi. (b). Blue nodes connect with other nodes in the blue group (ni, 712, ns, 714) with probability 
pi, and to nodes outside their group (yellow and purple nodes) with probability po- Note that yellow nodes have the same 
properties as blue nodes, (c), Green nodes connect to blue and yellow nodes with the same probability pi. Since green nodes 
are not in the same group as purple nodes, the probability of having a connection between a green and purple node is po- For 
generality, we assume that green nodes connect between themselves with probability p2. (d). Adjacency matrices for the model 
networks with p2 = pi and p2 — 2p\. We show both the adjacency matrix for a single realization of the model network and 
the average adjacency matrix, that is the fraction of times two nodes are connected in the ensemble of networks defined by the 
model. Note that p2 — 2pi has an overlapping region that is more densely connected, hence the average adjacency matrix has 
more "black." We show results for networks of 112 nodes divided into four groups of 32 nodes. Two of the four groups overlap 
by sharing 16 nodes and the corresponding edges. We select pi and po such that for nodes not in the overlapping region, the 
average degree z is 16 and the ratio r between the external-degree and internal-degree is r = 0.125. 
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Fig. 2. (Color online) Output of methods for group detection. We apply three different group detection algorithms, the 
spectral decomposition method, the modularity-landscape surveying method, and the A:-clique method to model networks with 
overlapping groups (Fig. 1). We show results obtained for a typical network realization for p2 = pi for two overlap sizes: large 
overlap (s = 0.25 — first row), and small overlap (s — 0.125 — second row). We generate these networks with the same parameters 
z and r that we use for the networks in Fig. 1. The first column shows the adjacency matrix. The second column shows the co- 
classification matrix obtained with the modularity- landscape surveying method (see text). The third column displays the group 
classification matrix obtained with the spectral decomposition method. The next two columns show the group classification 
matrices obtained with the fc-clique method for fe = 4 and k — 5. The final column shows the expected group classification. 
The colorbar is the same as in Fig. [1] In the group classification matrices, each row corresponds to a node, and each column 
corresponds to a group. If a node belongs to the group, a black band appears in that group's column. If a node belongs to 
more than one group, multiple bands will appear; these are nodes that the belong to the "overlap" between groups according 
to the algorithm. Dotted black lines indicate the nodes that by construction hold membership in more than one group and 
thus comprise the overlap in the model network. The modularity-landscape surveying method indicates a region of membership 
heterogeneity such that nodes in the overlapping groups have more in common with each other than with nodes in the non- 
overlapping groups (grey region), but this effect is dependent on the value of s. The spectral decomposition method yields three 
or four groups, depending on the value of s. The fc-clique method yields at least four groups for k > 4, some of which only 
contain a few nodes. 
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Fig. 3. (Color online) Performance of modularity and A;-clique methods for group detection. We generate ten networks for 
each set of parameter values, z and r. We then compare the groups we obtain from the fc-clique, spectral decomposition, and 
modularity- landscape surveying methods to the expected group assignment and compute mi (see text). We show the normalized 
mutual information, mj, versus average node degree z for networks with p2 = Pi and a large overlap (first row) and a small 
overlap (second row) (Fig. For the modularity- landscape surveying method, we computed mi with regards to the topmost 
level only. The first column of plots corresponds to r = 0.1, the second column to r = 0.125, and the third column to r = 0.25. 
Note that mi > indicates that the method considered yields significant information on the network's group structure. For 
the fc-clique method, mi < indicates either the lack of detectable cliques of that size or the tendency to put all nodes in 
the same group. Note that, for the spectral decomposition and modularity-landscape surveying methods, performance increases 
with increasing average node degree z and decreasing r. 
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Fig. 4. (Color online) Effect of overlap size. We plot the average number of groups detected using the spectral decomposition 
method for modularity maximization (black circles) versus the overlap size s (Eq. ^) for model networks with groups of 100 
nodes, z — 50, r = 0.1, and p2 = 2pi. We show the average and standard error obtained for forty networks. We also show the 
co-classification matrix A obtained from the modularity-landscape surveying method for a typical network for four different s 
values. In the co-classification matrices, each row/column represents a node and each element Aij corresponds to the fraction of 
the time two nodes are classified in the same partition for the maxima in the modularity landscape (see text). Each element is 
colored following the color code on the right hand side. To illustrate the differences for co-classification values close to one, we 
also plot log(l — Aij). Again, matrix elements are colored following the color code on the right hand side. Note that while the 
spectral decomposition method can only detect whether the overlap is large enough to transition between four to three groups, 
the modularity-landscape surveying method is able to capture the greater affinity between the nodes in two of the groups in 
the organization of the network. 
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Fig. 5. (Color online) Comparison between overlapping networks and networks with a hierarchical structure. We compare results 
from applying the three different methods (spectral decomposition, fc-clique percolation, and modularity-landscape surveying) 
to (i) networks with a small overlap, and (ii) networks with a hierarchical structure. We show typical results for (i) a network 
with overlapping groups of 100 nodes, z = 16, r = 0.1, p2 — 2 pi, and s = 0.1 (top row — see Fig. 1 for details); (ii) a network 
with hierarchical structure such that at the top level there are three groups (two of 100 nodes and one of 200 nodes), and the 
largest group is comprised of two sub-groups of 100 nodes. We construct the latter network such that z — 16 and r — 0.1, for 
nodes in the groups with flat organization. For edges involving nodes inside the large group, we link nodes with probability: 
(i) pi if they belong to the same sub-group, (ii) with p2 < Pi if they do not belong to the same sub-group, but belong to the 
same group at the top level; and, (iii) with po if they do not belong to the same group at the top level. Note that po is the 
same for any pair of edges running across groups, and that pi and p2 are selected such that z = 16. We show results for the 
case r2 = P2/P1 = 1/3. are very few differences in the results for both networks. The spectral decomposition method finds four 
groups for both cases. The fc-clique percolation method with k — 4 subtly outlines the underlying group structure of the network 
shown by the adjacency matrix. Results for k = 5 are not shown because there are no cliques of that size for sparse networks. 
Note that the signal detected by the fc-clique percolation method has significantly decreased compared to that for the smaller 
networks in Fig. (2). In contrast, the signal detected by the modularity-landscape surveying method has not decreased. Note 
that results for hierarchical and overlapping networks are very similar. 
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Fig. 6. (Color online) (a), This graph shows the average number of effective groups detected by spectral decomposition for a 
range of overlap sizes and for different average degrees. The average degrees investigated were z — 10, 20, 30, 40, 50. Forty 
networks were constructed for all cases except the z = 50 case, which only had ten networks for every overlap size. Each network 
consists of four groups, two of which share nodes. As the network becomes more dense with increasing degree, the transition 
from the detection of approximately four groups to three groups occurs abruptly at an overlap size of s = 0.16. (b), This graph 
shows the resolution of limit detectability. In the paper by Fortunato and Barthelemy, they determined that for a module to be 
detectable via spectral decomposition, the number of links within the module, Ig, should be less than j where L is the total 
number of links in the network. We tested these conditions for several networks with overlapping groups by construction. The 
size of the overlap varied from s = 0.16 to s = 0.25. Finally, forty networks were generated for each set of parameters except 
for 2 = 50, in which ten networks were generated. In this case, we plot 4j- for each overlap size and degree, and see that the 
transition of detectability occurs over the same range as is indicated in the average number of effective groups. The transition 
sites are circled. 



